NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The detection of automatic behavior in other people

https://doi.org/10.1037/amp0001440

Ullman, Tomer D; Bass, Ilona (December 2024, American Psychologist)

Full Text Available
Intuitive physics as probabilistic inference

Smith, Kevin A; Hamrick, Jessica B; Sanborn, Adam N; Battaglia, Peter W; Gerstenberg, Tobias; Ullman, Tomer D; Tenenbaum, Joshua B (November 2024, MIT Press)
Griffiths, Thomas L; Chater, Nick; Tenenbaum, Joshua T (Ed.)
Full Text Available
Teaching Without Thinking: Negative Evaluations of Rote Pedagogy

https://doi.org/10.1111/cogs.13470

Bass, Ilona; Espinoza, Cristian; Bonawitz, Elizabeth; Ullman, Tomer D (June 2024, Cognitive Science)

When people make decisions, they act in a way that is either automatic (“rote”), or more thoughtful (“reflective”). But do people notice when others are behaving in a rote way, and do they care? We examine the detection of rote behavior and its consequences in U.S. adults, focusing specifically on pedagogy and learning. We establish repetitiveness as a cue for rote behavior (Experiment 1), and find that rote people are seen as worse teachers (Experiment 2). We also find that the more a person's feedback seems similar across groups (indicating greater rote‐ness), the more negatively their teaching is evaluated (Experiment 3). A word‐embedding analysis of an open‐response task shows people naturally cluster rote and reflective teachers into different semantic categories (Experiment 4). We also show that repetitiveness can be decoupled from perceptions of rote‐ness given contextual explanation (Experiment 5). Finally, we establish two additional cues to rote behavior that can be tied to quality of teaching (Experiment 6). These results empirically show that people detect and care about scripted behaviors in pedagogy, and suggest an important extension to formal frameworks of social reasoning.
more » « less
Full Text Available
An approximate representation of objects underlies physical reasoning.

https://doi.org/10.1037/xge0001439

Li, Yichen; Wang, YingQiao; Boger, Tal; Smith, Kevin A.; Gershman, Samuel J.; Ullman, Tomer D. (June 2023, Journal of Experimental Psychology: General)

Full Text Available
Partial mental simulation explains fallacies in physical reasoning

https://doi.org/10.1080/02643294.2022.2083950

Bass, Ilona; Smith, Kevin A.; Bonawitz, Elizabeth; Ullman, Tomer D. (January 2022, Cognitive Neuropsychology)

Full Text Available
Rethink reporting of evaluation results in AI

https://doi.org/10.1126/science.adf6369

Burnell, Ryan; Schellaert, Wout; Burden, John; Ullman, Tomer D.; Martinez-Plumed, Fernando; Tenenbaum, Joshua B.; Rutar, Danaja; Cheke, Lucy G.; Sohl-Dickstein, Jascha; Mitchell, Melanie; et al (April 2023, Science)

Artificial intelligence (AI) systems have begun to be deployed in high-stakes contexts, including autonomous driving and medical diagnosis. In contexts such as these, the consequences of system failures can be devastating. It is therefore vital that researchers and policy-makers have a full understanding of the capabilities and weaknesses of AI systems so that they can make informed decisions about where these systems are safe to use and how they might be improved. Unfortunately, current approaches to AI evaluation make it exceedingly difficult to build such an understanding, for two key reasons. First, aggregate metrics make it hard to predict how a system will perform in a particular situation. Second, the instance-by-instance evaluation results that could be used to unpack these aggregate metrics are rarely made available ( 1 ). Here, we propose a path forward in which results are presented in more nuanced ways and instance-by-instance evaluation results are made publicly available.
more » « less
Full Text Available

Search for: All records